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Saxon XQuerv and XSLT Processor API - Tokenizer 
static int, QMARK "?" symbol, static int, RCURLY "}" symbol (XQuery only), boolean, 
recognizePragmas ... QMARK. public static final int QMARK. "?" symbol ... 
www.stylusstudio.com/api/ saxon8/net/sf/saxon/expr/Tokenizer.htm - 88k - Cached - Similar pages 

Saxon XQuery and XSLT Processor API - Constant Field Values 
public static final int, QMARK, 1 1 3. public static final int, RCURLY, 1 1 5. 
public static final int ... String, XSLT, "http://www.w3.org/1999/XSL/Transform" ... 
www.stylusstudio.com/api/saxon8/constant-values.htm - 304k - Cached - Similar pages 

Package oracle. xm I. parser. v2 

Table 11-4 Summary of XSLT Processor Classes in package oracle.xml.parserv2 ... 
QMARK. public static final int QMARK. ContentModelParseTreeNode type - "? ... 

www.cs.umb.edu/cs634/ora9idocs/ appdev.920/a96609/arj_xmlparserv2.htm - 513k - Cached - Similar pages 
Oracle XML Parser - Readme 

Version 2 of the XML Parser for Java, besides incorporating an XSLT processor, 
... QMARK, na, Implemented in v2.0.1 for tokenizer feature ... 

https://wwwforms.murdoch.edu.au/ xsql/java/parser/readme.html - 30k - Oct 26, 2005 - Cached - Similar pages 
Oracle XML Parser - Readme 

Note that there are several changes between April99 XSLT draft and the August99 
XSLT/Xpath ... The Node types are: PLUS, COMMA, ASTERISK, ELEMENT, QMARK ). ... 
https://ereg.bir.gov.ph/xdk/java/parser/readme.html - 38k - Oct 26, 2005 - Cached - Similar pages 

[ps] Automating the Transformation of XML Documents Hong SuComputer ... 
File Format: Adobe PostScript - View as Text 

The operation sequence then is used to generate an equivalent XSLT ... called qmark 
quantifier node). A tree rooted at a node of element type T is called T ... 
www.cs.wpi.edu/-suhong/research/PAPERS/xtra.ps - Similar pages 

Re: [DB-SIG] Re: module name :: ASPN Mail Archive :: db-sig 
ActiveState Open Source Programming tools for Perl Python XML xslt ... Also note 
that I used the 'qmark' paramstyle even though psycopg uses 'pyformat'. ... 
aspn.activestate.com/ASPN/Mail/Message/db-sig/2169978 - 20k - Cached - Similar pages 

Re: [DB-SIG] Table Oriented Programming :: ASPN Mail Archive :: db-sig 
ActiveState Open Source Programming tools for Perl Python XML xslt scripting ... 
With module. paramstyle = "qmark", you can do something like this: query2 ... 
aspn.activestate.com/ASPN/MaiI/Message/db-sig/2478147 - 23k - Cached - Similar pages 
[ More results from aspn.activestate.com ] 

From raymond at caret.cam.ac.uk Tue Mar 4 12:12:42 2003 From ... 
John John Kleeman MA MBCS C.Eng (john.kleeman@qmark.co.uk) Questionmark chairman 
... you have the option of using any one of a number of technologies (XSLT, ... 
lists.ucles.org.uk/public/ims-qti/2003-March.txt - 129k - Cached - Similar pages 
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Hong Su 




Department of Computer Science 
Worcester Polytechnic Institute (WPI) 
100 Institute Road 
Worcester, MA 01609 

Office: FL319 

Phone: +1-508-831-5857 (office) 
Fax: +1-508-831-5776 
E-Mail: suhong@cs.wpi.edu 



I am a Ph.D. candidate in the Department of Computer Science at Worcester Polytechnic 
Institute , Worcester, Massachusetts. I am interested in research and development issues 
related to data and information management systems, including (1) XML based web data 
management (2) web publish/subscribe systems (3) continuous queries and (4) translation 
technology across heterogeneous data sources. My supervisor is Professor Elke A. 
Rundensteiner . 

[ Research Experience ] [ Intern Experience ] [ Publications ] [ Useful Links ] 



Research Experience: 

• XML stream based web publishing and subscription system 

• XTra: XML Document Automatic Translation XTra is a prototype system discovering 
the mappings between two DTDs and translating an XML document conforming to the 
source DTD to an XML document in the format of target DTD. We model DTD as ordered 
trees. We propose a set of transformation operations and define a cost model. By 
discovering a sequence of the operations that transform the source DTD tree to target 
DTD tree, two DTDs are associated with each other. And then according to the DTD 
transformation, we perform corresponding data transformation (XSLT) on the source 
XML document to achieve the XML document transformation. 

• XEM: XML Evolution Management XEM is a middle-layer tool providing flexible 
support for both DTD and XML data evolution. We implemented a prototype that provides 
a set of valid, sound and complete DTD and XML evolution two DTDs primitives. And we 
provide mechanisms to do constraint checking to ensure the structural consistency that 
the XML data always conforms to DTD. Also, we provide the user the flexibility to a 
describe arbitrary transformations by composing primitives via an XML query language. 
We use PSE Pro, a OODB product by Excelon Inc to serve as the underlying data 
storage system. 

• Re-Web - OODB-XML Data Mapping and Its Applications in E-Commerce Re-Web 
is a web site management tool focusing on the issue of reusable view generation 
templates at the content level. Exploiting the modeling power of the 00 model, as part of 



http://web.cs.wpi.edu/-suhong/ 



10/28/05 



Page 2 of 4 



our approach we have defined web semantics that allow us to map between the 
XML/HTML constructs and the ODMG object model constructs. We use these to do both 
the translation of the web to the database as well as the generation of the web pages 
from the database. At the database level we use a flexible and extensible re-structuring 
facility, SERF, to support complex re-structuring, information integration, and the creation 
of new view schemas. 

Intern and Co-op Experience: 

• IBM Toronto Laboratory, Toronto, Canada, IBM Cooperative Fellowship Recipient for 
year 2002-2003. 

IBM Cooperative Fellowship Student Summer Intern in IBM DB2 Query Rewrite 
Development Team. 
Jun. 2002 - Aug. 2002 

Activities include investigating using XML Automatic Summary Tables (XML-ASTs) to 
answer the incoming XQueries on the virtual XML views of relational tables. 

• IBM Toronto Laboratory, Toronto, Canada, IBM Cooperative Fellowship Recipient for 
year 2001 -2002. 

IBM Cooperative Fellowship Student Summer Intern in IBM DB2 Universal Database 
Performance and Advanced Technology Team. 
Jun. 2001 - Aug. 2002 

Activities include cooperating with DB2 Performance and DB2 XML Extender team in 
exploring the performance bottleneck of decomposing XML documents into DB2. We 
analyze specific user cases and report the bugs in XML Extender to XML Extender team. 
In order to handle the constraint violation during the decomposition of XML documents 
into relational tuples as well as to improve the decomposition performance, we have 
proposed an approach that (1 ) utilizes LOAD, a DB2 data movement utility which is 
capable of efficiently moving large quantities of data into tables much faster than regular 
SQL INSERT as current Extender does; (2) provides options for handling constraint 
violation checking which both ensures a valid state of the relational tables storing 
decomposed XML documents and gives users the flexibility to handle errors. 

• HP Laboratories, Palo Alto, California 

Seed Research Intern (Grass Roots Breakthrough Program) in Software Lab. 
Sep. 2000 - Mar. 2001 

Research activities include survey on topics on general schema mapping, translation and 
investigation of automatic mapping and translation between e-business XML documents 
conforming to different DTDs. We also implemented a prototype system, XTra, to 
discover the mappings between two DTD models and then translates XML documents in 
source DTD format to target DTD format. We have filed two patents on this work, i.e., 
"Method and System for Discovery of Transformation Operations between two Extensible 
Markup Language (XML) Schemas" and "Method and System for Extensible Markup 
Language (XML) Schemas". 

• IBM TJ. Watson Research Center, Hawthorne, New York 
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Summer Intern in Parallel Database Department. 
May. 2000 - Aug. 2000 

Activities include doing survey on topics related to publishing XML-format data from 
underlying data sources that retail non-XML format data, understanding XML Access 
Server prototype that has been built for managing componentized, virtual XML 
documents, demonstrated the core module of XAS system - DTD source Annotation 
framework (DTDSA) on IBM Expo2000 in July, studying the issues of schema integration 
when using XAS system to store XML documents conforming to slightly different DTDs. 

Publications (see also the DSRG-List): 

Conference Papers: 

• Su, H., Rundensteiner, E. and Mani, M. 

"Semantic Query Optimization on XQuery over XML Streams " (.pdf) 
VLDB, Trondheim, Norway, Aug. 2005 

• Su, H., Rundensteiner, E. and Mani, M. 

"Raindrop: An XQuery Engine Over XML Streams - on Semantic Query Optimization " , 

um 

VLDB Demo, Toronto, ON, Canada, Sep. 2004 

• Su, H., Jian, J. and Rundensteiner, E. 

"RAINDROP: A Uniform and Layered Algebraic Framework for XQueries on 
XMLStreams", 

Twelfth International Conference on Information and Knowledge Management CIKM 
2003, New Orleans, LA, Nov. 2003 (.pdf) 

• Jian, J., Su, H. and Rundensteiner, E. 

"Automaton Meets Query Algebra: Towards A Unified Model for XQuery Evaluation over 
XML Data Streams" (Best Paper Nominee), 

22nd International Conference on Conceptual Modeling (ER), Chicago, IL, Oct. 2003 

urn 

• Kane, B., Su, H. and Rundensteiner, E. 

"Consistently Updating XML Documents Using Incremental Constraint Check Queries", 
4th International workshop on web information and data management (WIDM'02), 
McLean, Virginia, Nov. 2002 

• Su, H., Kane, B., Chen, C, Diep, Cuong., Guan, D., Look, J. and Rundensteiner, E. 

"A lightweight Constraint Check Framework for Consistently Updating XML Documents", 
2nd International Workshop on Evolution and Change in Data Management (ECDM , 02), 
Tampere, Finland, Oct. 2002 

• Su, H., Kuno, H. and Rundensteiner E. 
"Automating the translation of XML documents" 

3rd international workshop on web information and data management (WIDM'01), 
Atlanta, Georgia, Nov. 2001 (.ps) 
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• Su, H., Padmanabhan, S. and Lo, M-L 

"Identification of Syntactically Similar DTD Elements for Schema Matching" 
The Second International Conference on Web-Age Information Management 
(WAIM'2001 ), Xi'an, China, July. 2001 . (jds) 

• Claypool, K, Rundensteiner, E. A. , Zhang, X., Su, H., Kuno, H., Li, W-C. and Mitchell, G. 
"Model Management - A solution to Support Multiple Data Models, Their Mappings and 
Maintenance" 

ACM SIGMOD'01 Industrial Demo, Santa Barbara, California, May. 2001. (.ps) 

• Su, H., Kramer, D., Chen L, Claypool, K. and Rundensteiner, E. A. 
"XEM: Managing the evolution of XML Documents" 

Eleventh International Workshop on Research Issues in Data Engineering (RIDE 
2001), Heidelberg, Germany, April 1-2, 2001, (.ps) 

• Rundensteiner, E. A., Claypool, K. T., Chen, L., Su, H. and Oenoki, K. 
"SERFing the Web: Web-Site Management Made Easy" 

SIGMOD 2000, Software system demonstration paper, Dallas, Texas, USA, May 2000, 
pp. 585. (.ps) 

• Su, H. } Claypool, K. and Rundensteiner, E. A. 

"Extending the Object Query Language for Transparent Metadata Access", 

Ninth International Workshop on Foundations of Models and languages for Data and 

Objects (DEMM'00), Schloss Dagstuhl, Germany, Oct. 2000. (.ps) 

Demos: 

• Su, H., Kuno, H. and Rundensteiner, E. A. 
"XEM", 

Poster and System Demo, IBM Center for Advanced Studies Conferenc (CASCON'00), 
Toronto, Canada, Nov. 2000 

• Su, H., Kuno, H. and Rundensteiner, E. A. 

"XTra: Automating E-Service XML Document Translation", 

Poster and System Demo, IBM Center for Advanced Studies Conferenc (CASCON'01 ), 
Toronto, Canada, Nov. 2001 
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ps] Automating the Transformation of XML Documents Hong SuComputer ... 
File Format: Adobe PostScript - View as Text 

The operation sequence then is used to generate an equivalent XSLT transformation 
script. ... [15] H. Su, H. Kuno, and EA Rundensteiner. Automating ... 
www.cs.wpi.edu/~suhong/research/PAPERS/xtra.ps - Similar pages 

[pdf] Automating Transformation of Automating Transformation of XML ... 
File Format: PDF/Adobe Acrobat - View as HTML 

XML Documents. Hong Su*, Harumi Kuno**,. Elke A. Rundensteiner* ... Fold (<T1, 
T2, Ti>, T). Split (n, <n1, n2>). Relabel (n, I, I'). Delete (T) ... 

vyww.cs.wpi.edu/-suhong/research/PAPERS/widmtalk.pdf - Supplemental Result - Similar pages 

@string{acm = "Association for Computing Machinery 11 } @string{acmcs ... 
@inproceedings{smx_oois98, author = {EA Rundensteiner and HA Kuno and J. Zhou}, 
... AUTHOR = "Hong Su and Harumi Kuno and Elke A. Rundensteiner", ... 

davis.wpi.edu/-dsrg/rainbow/RainbowCore/ release/Doc/SystemDescription/web_db.bib - 149k- Supplemental 
Result - Cached - Similar pages 

wotan.liu.edu/docis/dbl/hicssh/index.html 
1432k - Supplemental Result - Cached - Similar pages 

[pdf] w\AAA/.inf.ufrgs.br/-kroth/pub/Query%20containment/li... 
File Format: PDF/Adobe Acrobat - View as HTML 
Supplemental Result - Similar pages 

[pdf] statistic.gunadarma.ac.id/pub/books/Ontology/web-s... 
File Format: PDF/Adobe Acrobat - View as HTML 
Supplemental Result - Similar pages 
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41 What makes the differences: benchmarking XML database implementations 

#Hongjun Lu, Jeffrey Xu Yu, Guoren Wang, Shihui Zheng, Haifeng Jiang, Ge Yu, Aoying Zhou 
February 2005 ACM Transactions on Internet Technology (TOIT), Volume 5 issue l 
Publisher: ACM Press 

Full text available: pdf(589.14 KB) Additional Information: full citation , abstract , references , index terms 

XML is emerging as a major standard for representing data on the World Wide Web. 
Recently, many XML storage models have been proposed to manage XML data. In order 
to assess an XML database's abilities to deal with XML queries, several benchmarks have 
also been proposed, including XMark and XMach. However, no reported studies using 
those benchmarks were found that can provide users with insights on the impacts of a 
variety of storage models on XML query performance. In this article, we report our ... 

Keywords: XML query processing, XML storage model, benchmark 

42 Accelerating XPath evaluation in any RDBMS 
<it> Torsten Grust, Maurice Van Keulen, Jens Teubner 

March 2004 ACM Transactions on Database Systems (TODS), volume 29 issue l 
Publisher: ACM Press 

Full text available: ^pdf(781.01 KB) Additional Information: full citation , abstract , references , index terms 

This article is a proposal for a database index structure, the XPath accelerator, that has 
been specifically designed to support the evaluation of XPath path expressions. As such, 
the index is capable to support all XPath axes (including ancestor, following, preceding- 
sibling, descendant-or-self, etc.). This feature lets the index stand out among related 
work on XML indexing structures which had a focus on the child and descendant axes 
only. The index has been designed with a close ... 



Keywords: Main-memory databases, XML, XML indexing, XPath 



43 



Human-computer interaction: A visual approach to define XML to FO transformations 
Gerardo Canfora, Luigi Cerulo 

July 2002 Proceedings of the 14th international conference on Software engineering 
and knowledge engineering SEKE '02 

Publisher: ACM Press 

Full text available: f g\ pdf(307.62 KB) Additional Information: full citation , abstract , references , index terms 
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Static analysis of XSLT programs 
Ce Dong, James Bailey 

January 2004 Proceedings of the fifteenth conference on Australasian database - 
Volume 27 CRPIT '04 

Publisher: Australian Computer Society, Inc. 

Full text available: ^ pdf(370.93 KB) Additional Information: full citation , abstract , references 

XML is becoming the dominant standard for representing and exchanging data on the 
World Wide Web. The ability to transform and present data in XML is crucial and XSLT 
(Extensible Stylesheet Language Transformations) is the principal programming language 
that supports this activity. Methods for analysis of XSLT programs are currently an 
important open issue. In this paper, we discuss new methods for analysing XSLT 
programs, which return information about reachability, invalid calling relations ... 



Keywords: XML, XSLT, termination analysis 



2 Session 4: Behavior3D: an XML-based framework for 3D graphics behavior 
Raimund Dachselt, Enrico Rukzio 

March 2003 Proceeding of the eighth international conference on 3D Web technology 
Publisher: ACM Press 

Full text available: "g| pdf(3.45 MB) Additional Information: full citation , abstract , references , index terms 

Success of 3D applications on the Web inherently depends on object behavior and 
interaction. Current Web3D formats often fall short in supporting behavior modeling. This 
paper introduces a flexible concept for declaratively modeling 3D object behaviors. Based 
on Extensible 3D (X3D) a node concept is suggested with object-oriented features such as 
inheritance, strong typing, and polymorphism. An XML-based language Behavior3DNode 
serves the interface definition of new nodes. Their implement ... 

Keywords: Contigra, Extensible 3D (X3D), SMIL, XML-Schema, animation, behavior 
language, dynamic grammar, object behaviors 



3 XML query and programming languages: Sub-document queries over XML with 
% XSQirrel 

Arnaud Sahuguet, Bogdan Alexe 

May 2005 Proceedings of the 14th international conference on World Wide Web 
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1 Document querying and transformation: A three-way merge for XML documents 
#> Ta ncred Lindholm 

▼ October 2004 Proceedings of the 2004 ACM symposium on Document engineering 
Publisher: ACM Press 

Full text available: ^ pdf(500.99 KB) Additional Information: full citation , abstract , references , index terms 

Three-way merging is a technique that may be employed for reintegrating changes to a 
document in cases where multiple independently modified copies have been made. While 
tools for three-way merge of ASCII text files exist in the form of the ubiquitous diff and 
patch tools these are of limited applicability to XML documents. 

We present a method for three-way merging of XML which is targeted at merging XML 
formats that model human-authored documents as ordered trees (e.g. rich text forma ... 



Keywords: XML, collaborative editing, conflict, structured text, three-way merge 



2 Research session: data warehousing and archive: Archiving scientific data 

# Peter Buneman, Sanjeev Khanna, Keishi Tajima, Wang-Chiew Tan 
June 2002 Proceedings of the 2002 ACM SIGMOD international conference on 

Management of data 
Publisher: ACM Press 

Full text available: f flpdf(1.27 MB) Additional Information: full citation , abstract , references , citings , index 

terms 

We present an archiving technique for hierarchical data with key structure. Our approach 
is based on the notion of timestamps whereby an element appearing in multiple versions 
of the database is stored only once along with a compact description of versions in which 
it appears. The basic idea of timestamping was discovered by Driscoll et. al. in the context 
of persistent data structures where one wishes to track the sequences of changes made to 
a data structure. We extend this idea to deve ... 

3 XML retrieval: Configurable indexing and ranking for XML information retrieval 
Shaorong Liu, Qinghua Zou, Wesley W. Chu 

>r July 2004 Proceedings of the 27th annual international ACM SIGIR conference on 
Research and development in information retrieval SIGIR '04 
Publisher: ACM Press 

Full text available' Additional Information: full citation , abstract , references , citings , index 



http://portal.acm.org/resul^ 10/28/05 



'Results (page 6): +node +join +xml 



Page 1 of 7 




RTAL 



USPTO 



Subscribe (Full Service) Register (Limited Service, Free) Login 

Search: ® The ACM Digital Library O The Guide 

I j > 

j +node +join +xml f ^fe$* 



"** m < r>h ^^Mmm^^m^ 8 Feedba °k Report a problem Satisfaction 

' " vmmmm • survey 

Terms used node join xml Found 707 of 1 66,357 

Sort results r re , evan ce iPI ^ Save results to a Binder Try an Advanced Search 

by ' ^ ftf ^ Try this search in The ACM Guide 

Dj , p^^ ^ ^^^^^. ^ Search Tips 

results" |expandedform:ii Q Qpen resu|ts ,„ a new 

window 

Results 101 - 120 of 200 Result page: previous 12345678910 next 

Best 200 shown Relevance scale GQiiH 

101 Paper session 4: XML query processing: Twig query processing over graph- 
structured XML data 

Zografoula Vagena, Mirella M. Moro, Vassilis J. Tsotras 

June 2004 Proceedings of the 7th International Workshop on the Web and 
Databases: colocated with ACM SIGMOD/PODS 2004 WebDB '04 

Publisher: ACM Press 

Full text available: B || pdf(522.10 KB) Additional Information: full citation , abstract , references , index terms 

XML and semi-structured data is usually modeled using graph structures. Structural 
summaries, which have been proposed to speedup XML query processing have graph 
forms as well. The existent approaches for evaluating queries over tree structured data 
(i.e. data whose underlying structure is a tree) are not directly applicable when the data 
is modeled as a random graph. Moreover, they cannot be applied when structural 
summaries are employed and, to the best of our knowledge, no analo ... 



102 Research session: XML query processing #2: FiST: scalable XML document filtering |j| 
by sequencing twig patterns 
Joonho Kwon, Praveen Rao, Bongki Moon, Sukho Lee 

August 2005 Proceedings of the 31st international conference on Very large data 
bases VLDB 05 

Publisher: VLDB Endowment 

Full text available: g pdf(359.93 KB) Additional Information: full citation , abstract , references , index terms 

In recent years, publish-subscribe (pub-sub) systems based on XML document filtering 
have received much attention. In a typical pub-sub system, subscribed users specify their 
interest in profiles expressed in the XPath language, and each new content is matched 
against the user profiles so that the content is delivered to only the interested 
subscribers. As the number of subscribed users and their profiles can grow very large, the 
scalability of the system is critical to the success of pub-sub se ... 



103 Extending Java for high-level Web service construction 
Aske Simon Christensen, Anders M0ller, Michael I. Schwartzbach 
November 2003 ACM Transactions on Programming Languages and Systems 

(TOPLAS), Volume 25 Issue 6 

Publisher: ACM Press 

Full text available: f& pdf(947.02 KB) Additional Information: full citation, abstract , references , citings, index 
u^—^ : terms 

We incorporate innovations from the <bigwig> project into the Java language to provide 
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